COMP124 � Week 9.2 Threads & Concurrency

Threads and Processes

A thread is like a mini-process within another process

Threads are executed independently from (and in parallel with) rest of process
Each thread has its own instruction pointer and stack space
But it shares code and data (variables) with other threads in the same process

Threads are a Higher Level Process; The kernel does not know these threads exist

In most systems, the kernel is not involved in context switching of individual threads

Handled at a higher level within the process that the threads belong to
Java threads are handled by the Java virtual machine (JVM) during its own time slice
These are known as user-level threads
Important mechanism for event-driven programs (especially UIs)

Threads

A thread can be seen as a lightweight process within a normal heavyweight process
It will usually serve a specific purpose that helps the main process to manage its tasks

Consider a web browser

One thread for retrieving data from the internet
Another thread to render the page within a window
Another thread to wait for keypresses and clicks in the user interface
Consider a word processor
One thread handles the rendering of text on screen
Another handles keypresses and mouse clicks
Another performs spellchecking in the background
Another performs grammar checking

Benefits

Modularity - Easier to create one thread for each distinct activity instead of trying to do everything in one linear piece of code
Responsiveness - A multithreaded application can continue to run even if one of the threads is blocked
Resource sharing - Threads can share memory and resources of the process they belong to (Concurrency needs to be handled properly)
Economy - Threads are easier to create and switch between because they are allocated and managed within a process that already exists
Parallel execution - Each thread can be physically running at the same time if the CPU has multiple cores (concurrent programming brings other issues)

Concurrent Programming

A single thread would perform all the steps in a calculation in a sequence (usually based on the mathematical order of precedence)
In a multi-threaded (or multi-core) system, this can be split into two types of operation

Concurrent operations can be executed in parallel (independently)
Serial operations rely on the results of earlier operations
For example, for the quadratic formula x=(-b+sqrt(b^2 - 4ac))/2a
Concurrent:
t1 = -b
t2 = b*b
t3 = 4*a
t4 = 2*a
Serial:
t5 = t3*c
t5 = t2 - t5
t5 = sqrt(t5)
t5 = t1 + t5
x = t5 / t4

Processing time is reduced from 9 units to 6 units as the concurrent operations can be done simultaneously
A good compiler will identify concurrent instructions and assign to different CPU cores

Java Threads

Each Java program runs in its own heavyweight process

The main program code runs in one lightweight thread
Java virtual machine (JVM) starts other threads for garbage collection, event handling, screen rendering, etc.
Programmer can spawn new threads within their code by extending the Thread class

public class Thread extends Object implements Runnable

The Runnable interface defines a run() method that the programmer must implement

This code does not start executing until the main program requests it
Create an instance of the thread then call its start() method
Threads are managed internally by the JVM
When JVM gets time on CPU it decides which thread to run (including its own)
So it has its own internal thread scheduling algorithm (similar to round robin)

Example

class MyWorker extends Thread {
	public void run() {
		System.out.println("I am a worker thread");
	}
}

public class MyMain {
	public static void main (String[] args) {
		MyWorker runner = new MyWorker();
		runner.start();
		System.out.println("I am the main thread");
	}
}

Threads don't start running until explicitly instructed to
You don't know which string output (main or worker) will happen first, as it depends on thread scheduling (However 99% of the time the worker output will occur first)

The MyWorker class extends the Thread class

So the programmer must implement the Runnable interface
Define a run() method and put the thread's code in it
In the main program:
Create a new object of type MyWorker
Call its start() method
Within the JVM:
Memory is allocated for the new thread object
Its run() method is called and begins execution
Now both the main program thread and the child thread will be running in parallel

Java Thread States

Each thread begins in the new state when its object is allocated memory

It moves to the runnable state when its start() method is called
It moves to the blocked state when its sleep() method is called, or when its waiting for I/O, or if its waiting for another thread to finish, after which it can be moved back to the runnable state
It moves to the dead state when it has terminated and is waiting for the JVM to clean up its memory via garbage collection

Synchronisation Problems

class TwoChar extends Thread {
	private char out1, out2;
	public TwoChar(char first, char second) {
		out1 = first;
		out2 = second;
	}
	public void run() {
		System.out.print(out1);
		System.our.print(out2);
	}
}

This thread object has a constructor that sets two internal character variables
When the thread runs, it will output each character one after the other

If we have a main program like so:

public class ThreadExample {
	public statis void main(String[] args) {
		TwoChar tc1 = new TwoChar('A', 'B');
		TwoChar tc2 = new TwoChar('1', '2');
		tc1.start();
		tc2.start();
	}
}

There will be different results when this is run, depending on the JVM thread scheduler
AB12, A1B2, 12AB, 1A2B, 1AB2, A12B
AB12 will probably appear most often as this is the order they appear in the program code, however there is no guarantee about the execution order of threads

Shared Variables

Suppose there is an object that can be shared by multiple threads

class Something {
	private int num = 0;
	public void increase() {
		num++;
	}
}

This object is passed into the constructor of two threads

Something thing = new Something();
MyThread t1 = new MyThread(thing);
MyThread t2 = new MyThread(thing);

At some point during program execution, both threads access the shared object
thing.increase();
We cannot guarantee that both increases will happen (due to thread scheduling)

Race Conditions

Consider what happens at the register (assembly) level when the increase happens

mov eax, num
inc eax
mov num, eax

Works perfectly fine for most of the time, but occasionally the instructions are executed in a way that causes a problem
If the starting value is 0, we would expect it to be 2 after both threads have executed
However, there is a chance that in the midst of carrying these instructions out to increment num, thread 1 runs out of it's time slice which would cause something like this:

T1	T2	NUM
mov eax, num		0
inc eax		0
	mov eax, num	0
	inc eax	0
	mov num, eax	1
mov num, eax		1
Where `num` is only incremented once, as T1 moves the same value in the accumulator into `num` as T2 did, as it was cut off during its first burst

This is known as a race condition and is very hard to debug or even notice it happened
When some area of memory isn't written or updated because it was interrupted at just the wrong time

If the code is particularly for something important and vital, eg. a rocket, you don't want these race conditions as you'd want your code to work as intended 100% of the time
Race conditions involve some (even if just a little) uncertainty

Critical Regions

A race condition could happen whenever a variable is shared between multiple threads

Most of the time it won't happen because the threads are uninterrupted
The problem can't be ignored because sometimes it will happen
Whenever a process or thread is accessing a shared resource, its code is said to be in a critical region
It must be ensured that only one thread (or process) can access the resource at any one time
The programmer must somehow 'lock' the resource so it gets exclusive access
Other threads must wait for the resource to be unlocked

Semaphores

The concept of semaphores (or tokens) comes from single track railways

A train can only enter a single track if the driver picks up a physical token
Only one token exists, so only one train can be on the track
Driver leaves token at the other end when they have passed the single track
Programming semaphores were proposed by Dijkstra in 1960s
Critical region of code is like a single track railway
Thread must acquire the semaphore before it can enter the region
Other threads must wait (block) until the semaphore becomes available
Use of semaphores requires careful programming
Must consider how each shared resource will be used by each thread
Critical regions must be kept as small as possible to reduce thread blocking